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Abstract — This paper considers a smart grid cyber-security 
problem analyzing the vulnerabilities of electric power networks 
to false data attacks. The analysis problem is related to a 
constrained cardinality minimization problem. The main result 
shows that an h relaxation technique provides an exact optimal 
solution to this cardinality minimization problem. The proposed 
result is based on a polyhedral combinatorics argument. It is 
different from well-known results based on mutual coherence 
and restricted isometry property. The results are illustrated on 
benchmarks including the IEEE 118-bus and 300-bus systems. 

Index Terms — Power network state estimation, security, oper- 
ation research, optimization methods. 



I. Introduction 

A modern society relies critically on the proper operation of 
the electric power distribution and transmission system, which 
is supervised and controlled through Supervisory Control 
And Data Acquisition (SCADA) systems. Through remote 
terminal units (RTUs), SCADA systems measure data such 
as transmission line power flows, bus power injections and 
part of the bus voltages, and send them to the state estimator 
to estimate the power network states (e.g., the bus voltage 
phase angles and bus voltage magnitudes). The estimated 
states are used for vital power network operations such as 
optimal power flow (OPF) dispatch and contingency analysis 
(CA) fl], El. See Fig. Q] for a block diagram of the above 
functionalities. Any malfunctioning of these operations can 
delay proper reactions in the control center, and lead to 
significant social and economical consequences such as the 
northeast US blackout of 2003. 

The technology and the use of the SCADA systems have 
evolved a lot since the 1970s when they were introduced. The 
SCADA systems now are interconnected to office LANs, and 
through them they are connected to the Internet. Hence, today 
there are more access points to the SCADA systems, and also 
more functionalities to tamper with. For example, the RTUs 
can be subjected to denial-of-service attacks (Al in Fig. [TJ. 
The communicated data can be subjected to false data attacks 
(A2). Furthermore, the SCADA master itself can be attacked 
(A3). This paper focuses on the cyber security issue related 
to false data attacks (A2), where the communicated metered 
measurements are subjected to additive data attacks. A false 
data attack can potentially lead to erroneous state estimates 
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Fig. 1: Block diagram of power network control center and 
SCADA. RTUs connected to the substations transmit and 
receive data from the control center using the SCADA system. 
At the control center, a state estimate is computed and then 
used by Energy Management Systems (EMS) to send out 
commands to the power network. The human figures indicate 
where a human is needed in the control loop. This paper 
considers the false data attack scenario in A2. 



by the state estimator, which can result in gross errors in OPF 
dispatch and CA. In turn, these can lead to disasters of signif- 
icant social and economical consequences. False data attack 
on communicated metered measurements has been considered 
in the literature (e.g., C3|-|10|). [3| was the first to point out 
that a coordinated intentional data attack can be staged without 
being detected by state estimation bad data detection (BDD) 
algorithm, which is a standard part of today's SCADA/EMS 
system. (H-Q, 1171-lfTO'l investigate the consttuction problem 
for such "unobservable" data attack, especially the sparse ones 
involving relatively few meters to compromise, under various 
assumptions of the network (e.g., DC power flow model HI, 
0). In particular, poses the attack construction problem as 
a cardinality minimization problem to find the sparsest attack 
including a given set of target measurements. JU, (5|, |8| set 
up similar optimization problems for the sparsest attack includ- 
ing a given measurement. (7), iflOl seek the sparsest nonzero 
attack and [9| finds the sparsest attack including exactly two 
injection measurements. The solution information of the above 
optimization problems can help network operators identify 
the vulnerabilities in the network and strategically assign 
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protection resources (e.g., encryption of meter measurements) 
to their best effect (e.g., 0, 0, JTOjl). On the other hand, 
the unobservable data attack problem has its connection to 
another vital EMS functionality, namely observability analysis 
0, ID- In particular, solving the attack construction problem 
can also solve an observability analysis problem (this is to be 
explained in Section Hl-Ct . This connection was first reported 
in Q, and was utilized in ifTTl to compute the sparsest critical 
p-tuples for some integer p. This is a generalization of critical 
measurements and critical sets 0. 

To perform the cyber-security analysis in a timely manner, 
it is important to solve the data attack construction problem 
efficiently. This effort has been discussed, for instance, in 0- 
0, ll7l- lfT0l . The efficient solution to the attack construction 
problem in [4] is the focus of this paper. The matching pursuit 
method iflZl employed in and the basis pursuit method 
|[T3l (Zi relaxation and its weighted variant) employed in iflOl 
are common efficient (i.e., polynomial-time) approaches to 
suboptimally solve the attack construction problem. However, 
these methods do not guarantee exact optimal solutions, and 
in some cases they might not be sufficient (see for instance 1 8 1 
for a naive application of basis pursuit and its consequences). 
While 0, iflOll provide polynomial-time solution procedures 
for their respective attack construction problems, the problems 
therein are different from the one in this paper. Furthermore, 
the considered problem in this paper cannot be solved as a 
special case of 0, IflOl . In particular, in [7| the attack vector 
contains at least one nonzero entry. However, this nonzero 
entry cannot be given a priori. [10] needs to restrict the number 
of nonzero injection measurements attacked, while there is 
no such requirement in the problem considered in this paper. 
In 01, a simple heuristics is provided to find suboptimal 
solutions to the attack construction problem. This heuristics, 
however, might not be sufficiently accurate. 0, IfTTl . Il23l is 
most closely related to the current work. The distinctions will 
be elaborated in Section IIV-BI 

The main conclusion of this paper is that basis pursuit 
(i.e., li relaxation) can indeed solve the data attack con- 
struction problem exactly, under the assumption on the net- 
work metering system that no injection measurements are 
metered. The limitations of this assumption will be discussed 
in Section IIV-AI In fact, the main result identifies a class 
of cardinality minimization problems where basis pursuit can 
provide exact optimal solutions. This class of problems include 
as a special case the considered data attack construction 
problem, under the assumption above. 

Outline: Section HI] describes the state estimation model 
and introduces the optimization problems considered in this 
paper. Section [HI] describes the main results of this paper - the 
solution to the considered optimization problems. Section [TV] 
compares the proposed result to related works. Section [V] 
provides the proof the proposed main results. Section [VI] 
numerically demonstrates the advantages of the proposed 
results. 



II. State Estimation and Cyber-Security Analysis 
Optimization Problems 

A. Power network model and state estimation 

A power network with n + 1 buses and m a transmission 
lines can be described as a graph with n + 1 nodes and m a 
edges. The graph topology can be specified by the (directed) 
incidence matrix Bq e K(™+ 1 ) xm o ) in which the direction of 
the edges can be assigned arbitrarily. The physical property 
of the network is described by a nonsingular diagonal matrix 
D 6 M miXm ', whose nonzero entries are the reciprocals of 
the reactance of the transmission lines. 

The states of the network include bus voltage phase angles 
and bus voltage magnitudes, the latter of which are typically 
assumed to be constant (equal to one in the per unit system). 
In addition, since one arbitrary bus is assigned as the reference 
with zero voltage phase angle, the network states considered 
can be captured in a vector 9 e [0, 27r)™. The state estimator 
estimates the states 9 based on the measurements obtained 
from the network. Under the DC power flow model 0, 
the measurement vector, denoted as z, is related to 9 by 



z = H9 + Az, where H = 



PDB T 
QBDB T 



(1) 



In (Q~|), Az can be either a vector of random error or intentional 
additive data attack (e.g., 0). B e R" xm » is the truncated 
incidence matrix (i.e., Bq with the row corresponding to 
the reference node removed), and P consists of a subset 
of rows of an identity matrices of appropriate dimension, 
indicating which line power flow measurements are actually 
taken. Together, PDB T 9 is a vector of the power flows on the 
transmission lines to be measured. Analogously, the matrix Q 
selects the bus power injection measurements that are taken. 
QBDB T 9 is a vector of power injections at the buses to be 
measured. Therefore, H is the measurement matrix, relating 
the measured power quantities to the network states. The 
number of rows of H is denoted m. 

The measurements z and the network information H are 
jointly used to find an estimate of the network states denoted 
as 9. Assuming that the network is observable, it is well- 
established that the state estimate can be obtained using the 
weighted least squares approach [U Chapter 5] J21 Chapter 8]: 



6 = (H T WH) l WH T z, 



(2) 



where IF is a positive definite diagonal weighting matrix, 
typically weighting more on the more accurate measurements. 
The state estimate 9 is subsequently fed to other vital SCADA 
functionalities such as OPF dispatch and CA. Therefore, the 
accuracy and reliability of 9 is of paramount concern. 

To detect possible faults in the measurements z, the BDD 
test is commonly performed (see 0, 0). In one typical 
strategy, if the norm of the residual 

residual = z - H0 = (I - H(H T WH) 1 WH T )Az (3) 
is too big, then the BDD alarm will be triggered. 
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B. Unobservable data attack and security index 

The BDD test is in general sufficient to detect the presence 
of Az if it contains a single random error (TJ, 0. However, in 
face of a coordinated malicious data attack on multiple mea- 
surements the BDD test can fail. In particular, [3| considers 
unobservable attack of the form 



Az = HA9 



(4) 



for an arbitrary AO G R". Since Az as defined in (0 would 
result in a zero residual in (0, it is unobservable from the BDD 
perspective. This was also experimentally verified in [14| in a 
realistic SCADA system testbed. To quantify the vulnerability 
of a network to unobservable attacks, |@] introduced the notion 
of security index for an arbitrarily specified measurement. The 
security index is the optimal objective value of the following 
cardinality minimization problem: 



minimize 



\HA6\\ 



subject to H(k, :)A9 = 1, 



(5) 



where k is given, indicating that the security index is computed 
for measurement k. The symbol || • || denotes the cardinality 
of a vector and H(k, :) denotes the fc th row of H. The 
security index is the minimum number of measurements an 
attacker needs to compromise in order to attack measurement 
k undetected. In particular, a small security index for a 
particular measurement k means that in order to compromise k 
undetected it is necessary to compromise only a small number 
of additional measurements. This can imply that measurement 
k is relatively easy to compromise in an unobservable attack. 
As a result, the knowledge of the security indices allows the 
network operator to pinpoint the security vulnerabilities of 
the network, and to better protect the network with limited 
resource. To model the case where certain measurements are 
protected (hence cannot be attacked) 0, [5|, |6|, problem (0 
becomes: 



minimize 

A6»eK" 



\\HA6\\ Q 

subject to H(k, :)A9 = 1 
H(I, :)A9 = 0, 



(6) 



where the protection index set I C {1,2,..., m} is given, 
H(X,:) denotes a submatrix of H with rows indexed by I. 
By convention, the constraint H(I, :)A9 = is ignored when 
1 = 0. Hence, (0 is a special case of (0. 

C. Measurement set robustness analysis 

Problem <(5j is also motivated from another important state 
estimation analysis problem, namely observability analysis (TJ, 
0. The measurement set, described by H in (0, is observable 
if 9 can be uniquely determined by (0. An important question 
of observability analysis is as follows: 



minimize 

J 



\\J\\ Q 

subject to iaxik(H(J , :)) < n 
rank(iJ(Ju{fc},:)) 
kej. 



(7) 



In above, k is a given index and J denotes the complement 
of J (for index set 2" in the rest of the paper, 2" denotes its 
complement). The meaning of (0 is as follows: J denotes 
a subset of measurements from the measurement system 
described by H. The condition that rank(_ff {J , :)) < n 
means that the measurement system becomes unobservable 
if the measurements associated with J are lost. That is, it 
becomes impossible to uniquely determine 9 from H(J , :). 
The problem in seeks the minimum cardinality J which 
must include a particular given measurement k. Therefore, if 
there exist a measurement k which leads to an instance of 
with a very small objective value, then the measurement 
system is not robust against meter failure. Special cases of (0 
have been extensively studied in the power system community. 
For instance, the solution label sets of cardinalities one and 
two are, respectively, referred to as critical measurements and 
critical sets containing measurement k [1|. Their calculations 
have been documented in, for example, (TJ, fl5l -l 181. For the 
more general cases where the minimum cardinality is p > 2, 
the solution label set in is a critical p-tuple which contains 
the specified measurement k ifTTI . 11191 . Solving (0 solves (0 
as well. The justification is given by the following statement 
inspired by [7], and proved in Appendix: 

Proposition 2.1: Let H E R mxn and k G {1, 2, . . . , m} be 
given for problems (0 and (0. Denote the two conditions: 

I: H(k, :) ^ 0. 

II: H has full column rank {— n). 

The following three statements are true: 

(a) Problem (0 is feasible if and only if condition I is 
satisfied. 

(b) Problem (0 is feasible if and only if conditions I and 
II are satisfied. 

(c) If conditions I and II are satisfied, then (0 and (0 
are equivalent (see Definition Q] in Section [V- Al l. 

Note that if condition I is not satisfied, then the corresponding 
measurement k should be removed from consideration. Also, 
since measurement redundancy is a common practice in power 
networks (TJ, 0, H can be assumed to have full column rank 
(= n). Therefore, conditions I and II in Proposition ^. ll can be 
justified in practice. Finally, note that Proposition [2J] remains 
true for arbitrary matrix H (not necessarily defined by (0). 



III. Problem Statement and Main Result 

A. Problem statement 

As discussed previously, this paper proposes an efficient 
solution to the security index (i.e., attack construction) prob- 
lem in (O. However, the proposed result focuses only on a 
generalization of a special case of (0. In this special case, H 
in (0 does not contain injection measurements: 



H = PDB 



(8) 



The limitation of the assumption in (0 will be discussed in 
Section IIV-AI after the main result is presented. 
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In the Appendix, it is shown that the special case of (0 
with the assumption in ((H) is equivalent to 

minimize \\P(X -.)B T A9\\ n 

subject to P(k, :)B T A8 = 1 (9) 
P(X, :)B T A9 = 0. 

Instead of considering (O directly, the proposed result pertains 
to a more general optimization problem associated with a 
totally unimodular matrix (i.e., the determinant of every square 
submatrix is either —1, 0, or 1 l20l ). In particular, the 
following problem is the main focus of this paper: 

minimize \\A (I, :)x L 
subject to A(k, :)x = 1 (10) 
A(X,:)x = 0, 

where A g jg m ><™ j s a given totally unimodular matrix, and 
k £ {1, 2, . . . , m} and I C {1, 2, . . . , ra] are given. Since B 
in (0 is an incidence matrix, PB T is a totally unimodular 
matrix. Therefore, ( fTOt is a generalization of (|5). However, 
neither ( TT~0b nor (|6]l includes each other as special cases. 



C. Statement of main result 

Theorem 3.1: Let {x*,, x*_ , y* ) be an optimal basic 
feasible solution to (fT2l . where A, fc and I are defined in 
([Tol l. Then x* = (x*L — x*_) is an optimal solution to (TTOb . 

Remark 3.1: Theorem 13.11 provides a complete procedure 
for solving ( TTOb via (fT2l . If the standard form LP problem 
in (IT2b is feasible, then it contains at least one basic feasible 
solution (see the definition in Section IV-Ab . Together with 
the fact that the objective value is bounded from below (e.g., 
by zero), [22, Theorem 2.8, p. 66] implies that problem (fLZb 
contains at least one optimal basic feasible solution, which can 
be used to construct an optimal solution to (TlOb according to 
Theorem 13. II Conversely, if the feasible set of dTzb is empty, 
then the feasible set of (TlOb must also be empty because a 
feasible solution to (fT0T > can be used to construct a feasible 
solution to dl2l . 

Remark 3.2: To ensure that an optimal basic feasible solu- 
tion to ( fT2l is found if one exists, the simplex method (e.g., 
11221 Chapter 3]) can be used to solve (fT2l . 

The proof of Theorem l3.1l will be given in Section[V] Before 
that, the related work are reviewed, and the assumption in (© 
is discussed. 



B. l\ relaxation 

Problem (IToT > is a cardinality minimization problem. In 
general, no efficient algorithms have been found for solving 
cardinality minimization problems lETl . so heuristic or relax- 
ation based algorithms are often considered. The l\ relaxation 
(i.e., basis pursuit |13|) is a relaxation technique which has 
received much attention. In l\ relaxation, instead of ([Tol l, the 
following optimization problem is set up and solved: 



minimize 



subject to A(k, :)x = 1 
A(X,;)x = 0, 



(11) 



where in the objective function in (fTTT i the vector 1-norm 
replaces the cardinality in ( TTOb . Problem (fTTT i can be rewritten 
as a linear programming (LP) problem in standard form 
pp. 4-6, p.17]: 



minimize 

x+,x-,y + ,y- 

subject to 



E (y+(j) + y-(j)) 

A(I,:)(x+ -x-)=y+-y- 
A(k, :)(x + -x-) = l 
A(X,;)(x + -X-) = 



x- e 



y+ e 



if 



y- e 



(12) 

where \1\ denotes the cardinality of the index set X. 

If (x+, x-,y+,y~) is a feasible solution to (fT2l . then 
x = £+ — X- is feasible to ( TTOb - Hence, an optimal solution 
to ( TT2l . if it exists, corresponds to a suboptimal solution to 
the original problem in (TTOb . An important question is under 
what conditions this suboptimal solution is actually optimal to 
( [Tol l. An answer is provided by our main result, based on the 
special structure in (TlOb and the fact that matrix A is totally 
unimodular. 



IV. Related Work 

A. Rationale of the no injection assumption in (0 

Consider the case of (0 where I corresponds only to line 
power flow measurements, then with the definition of H in 
([TJ it can be verified that (|6]l is equivalent to the following: 

minimize \\P(X, :)B T A9L + \\QB T DBA6\\ n 

subject to P(k, :)B T A9 = 1 (13) 
P(X, :)B T A0 = 0. 

This indicates that the considered problem in © is a re- 
laxation ||22l | of the general case in (fT3l . ||8) utilizes this 
observation and obtains satisfactory suboptimal solution to 
(O. Alternatively, ifTTl considers indirectly accounting for 
the term ||<3i? T £)i?A(9|| in the objective function of ( fT3l . 
IfTTl demonstrates that solving the following problem provides 
satisfactory suboptimal solution to ( fT3l 



minimize 

A6»eR™ 



P(I, :)B T AO 

subject to P(k, :)B T A6 = 1 
P(X, :)B T A6 = 0, 



(14) 



with appropriately defined P, I and k. Notice that (fT4l > has the 
same form as In conclusion, the "no injection assumption" 
in dHJ which leads to (0 introduces limitation, but it need 
not be as restrictive as it might appear. The proposed result 
in Theorem 13.11 still leads to a LP based approach to obtain 
suboptimal solutions to (fT3l (and hence ©). 

B. Relationship with minimum cut based results 

Nevertheless, the main strength of the current result lies 
in the fact that it solves problem ( TTOb where the A matrix 
is totally unimodular. ( TTOb includes @ as a special case 



5 



where the corresponding constraint matrix is a transposed 
graph incidence matrix. This distinguishes the current work 
with other ones such as (8J, ifTTl . 11231 which specialize in 
solving (0 using graph-based minimum cut algorithms (e.g., 
[24-1). One example of A which is totally unimodular but 
not associated with a graph is the matrix with consecutive 
ones property (i.e., if either for each row or for each column, 
the l's appear consecutively) [25 1. For a possible application, 
consider a networked control system l26l . Il27l with one 
controller and n sensor nodes. Each node contains a scalar 
state value, constant over a period of m time slots. The nodes 
need to transmit their state values through a shared channel 
to the controller. Each node can keep transmitting over an 
arbitrary period of consecutive time slots. At each time slot, 
the measurement transmitted to the controller is the sum of 
the state values of all transmitting nodes. Denote z £ K" 1 as 
the vector of measurements transmitted over all time slots, 
and 9 £ M™ as the vector of node state values. Then the 
measurements and the states are related by z = Ad, where 
A £ Rmxn j s a (o,l) matrix with consecutive ones in the 
each column. Solving the observability problem in (0 with 
H = A can identify the vulnerable measurement slots, which 
should have higher priority in communication. 

C. Relationship with compressed sensing type results 

Problem (TTOt can be written in a form more common in 
the literature. Consider only the case where the null space of 
A T is not empty (otherwise rank(A) = m and (TTOb is trivial). 
With a change of decision variable z = Ax, ( TTOb can be posed 
as: 

minimize II z (X) II n 

subject to Lz = 

z(X) = 
z{k) = 1, 



(15) 



where L has full rank and LA — 0, and z(X) denotes a 
sub-vector of z containing the entries corresponding to the 
index set X. (Q3) can be written as the cardinality minimization 
problem considered, for instance, in ll2D . [28|-|30|: 



minimize 



(16) 



subject to 

with appropriately defined matrix $ and vector b. In this 
subsection, we restrict the discussion to the standard case. 
That is, (IToT i is feasible and $ is a full rank matrix with 
more columns than rows. As ( TToT l is well-studied, certain 
conditions regarding when its optimal solution can be obtained 
by l\ relaxation are known. For example, 11281 . iTJTl report 
a sufficient condition based on mutual coherence, which is 
denoted as and defined as 



M ($) 



m,iY<S>(:,3)\ 

<)ll a ll*0.i)ll 



(17) 



The sufficient condition 1 30 1 states that if there exists a feasible 
solution z in ( TToT l which is sparse enough: 

1 / 1 



then z is the unique optimal solution to ( TToT l and its li relax- 
ation (i.e., problem ( TToT l with ||5|| x replacing ||2|| ). Another 
well-known sufficient condition is based on the restricted 
isometry property (RIP) I2T1 . 11321 . For any integer s, the RIP 
constant S s of matrix $ is the smallest number satisfying 

(l-6 s )\\x\\l<\\<Z>xf 2 <(l + S s )\\xf 2 (19) 

for all vector x such that ||x|| < s. The RIP-based sufficient 
condition [32] states that if for some s, <£> has a RIP constant 
82s < v2 — 1, then any z satisfying $z = b and \\z\\ < s 
is necessarily the unique optimal solution to both ( TToT l and 
its l\ relaxation. It has been shown that certain type of 
randomly generated matrices satisfy the above conditions with 
overwhelming probabilities (e.g., 1211 provides a RIP-related 
result). However, the above conditions might not apply to ( TTOl . 
which is the focus of this paper. For instance, consider A in 
([Tol l being a submatrix of the transpose of the incidence matrix 
of the 6-bus power network from [33 1: 



A = 



"1 


-1 

















1 


-1 




















1 


-1 

















1 


-1 


1 








-1 











1 








-1 











1 








-1 



Let k 
b are 





"1 


1 


-1 


-1 


-1 





f 




"0" 


$ = 


1 





-1 





-1 


1 





b = 






















1 







1 



l*llo<2 



1 



(18) 



6, and X = 0. Then the corresponding $ in ( TToT l and 



(20) 



For this <£>, (TTTl i implies that /i( < I > ) = 1. Therefore, the sparsity 
bound in (TT8l becomes ||z|| < 1. This is too restrictive 
to be practical. Similarly, for all s > 1, the RIP constants 
82s are at least one because $(:, 1) = — $(:,3). Hence the 
RIP-based sufficient condition would not be applicable either. 
Nevertheless, the failure to apply these sufficient conditions 
here does not mean that it is impossible to show that l\ 
relaxation can exactly solve (TToT l. The mutual coherence and 
RIP-based conditions characterize when a unique optimal 
solution exists for both (TToT l and its l\ relaxation, while in 
this paper uniqueness is not required. Indeed, for (TToT l with $ 
and b defined in©, both [-1 -1 1 0] T and 
[-1 1 1 0] are optimal (this can be verified 
by inspection). Using the CPLEX LP solver iTJU in MATLAB 
to solve the l\ relaxation leads to the first optimal solution. 
It is the main contribution of this paper to show that this is 
the case in general when ( TToT l is defined by ( TTOl i. even though 
the optimal solution might not be unique. The reason why the 
proposed result is applicable is that it is based on a polyhedral 
combinatorics argument, which is different from those of the 
mutual coherence and RIP based results. 

V. Proof of the Main Result 
A. Definitions 

The proof requires the following definitions: 
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Definition 1: Two optimization problems are equivalent 
if there is an one-to-one correspondence of their instances. 
The corresponding instances either are both infeasible, both 
unbounded or both have optimal solutions. In the last case, 
it is possible to construct an optimal solution to one problem 
from an optimal solution to the other problem and vice versa. 
In addition, the two problems have the same optimal objective 
value. 

Definition 2: A polyhedron in W is a subset of R p 
described by linear equality and inequality constraints. A 
standard form polyhedron (as associated with a standard form 
LP problem instance) is specified by {9 \ CO = d, 9 > 0} for 
some given matrix C and vector d. 

Definition 3: A basic solution ll22l p. 50] of a polyhedron 
in W p is a vector satisfying all equality constraints. In addition, 
out of all active constraints p of them are linearly independent. 
For a standard form polyhedron with a constraint matrix of full 
row rank, basic solutions can alternatively be defined by the 
following statement [22 p. 53]: 

Theorem 5.1: Consider a polyhedron {9 \ C9 = d, 9 > 0}, 
and assume that C £ M. lxp and C has full row rank. A vector 
9 is a basic solution if and only if C9 = d and there exists an 
index set J C {1, 2, . . . ,£>}, with \ J\ = I, such that det(C(: 
,J))^0 and 9(i) = if 

Definition 4: A basic feasible solution ([22j p. 50] of a 
polyhedron is a basic solution which is also feasible. By 
convention, the terminology "a basic feasible solution to a LP 
problem instance" should be understood as a basic feasible 
solution of the polyhedron which defines the feasible set of 
the instance. 



B. Proof 

Two lemmas, key to the proof, are presented first. The first 
lemma states that problem dT2b . as set up by l\ relaxation, has 
integer-valued optimal basic feasible solutions. 

Lemma 5.1: Let {x*,, x*_, y+, y*_) be an optimal basic fea- 
sible solution to (fT2l >. Then it holds that x*(i) = {x*L{i) ~ 
x±(i)) G {-1,0,1} for all 1 < i < n. In addition, 
y*(j) 4 \A{X(j),:)x*\ G {0, l}_for all 1 < j < \X\, where 
X(j) denotes the j th element of X, 

Proof: Assume that the feasible set of ( TT~2T > is nonempty, 
otherwise there is no basic feasible solution (cf. Definition 0). 
The following two claims are made: 

(a) A(k, :) cannot be a linear combination of the rows 
of A(X,:). 

(b) There exists X' C X such that either X' = or the 
rows of A(X', :) are linearly independent. In addition, 
in both cases A(X', :)9 = and A(X, :)9 = define 
the same constraints. 

Claims (a) and (b) together imply that problem (fT2l can be 
written as a standard form LP problem with a constraint matrix 
with full row rank (i.e., matrix C below): 



with 



minimize f T 9 
e J 

subject to C9 = d 

9>0, 



(21) 



C = 



d^ 



A(X, 
A(X' , 
A(k,: 



-A(X, :) 
-A(X',:) 



-I\z\ %| 



-A(k,:) 








x+ 




Onxl 




X- 

y+ 


f = 


n xl 
l|X|xl 


1 


y-. 




.- l -|I|xl- 





where 



(22) 



and 1 is a 



is an identity matrix of dimension \X\. 
vector of all ones. 

To see the claims, first note that (a) is implied by the 
feasibility of (0. For (b), If X = or A(X, :) = 0, then 
set X' = 0. Otherwise, there exists X' C X with the properties 
that \X'\ = rank(j4(X, :)), A(X',:) has linearly independent 
rows and A(X, :) = SA(X', :) for some matrix S. On the 
other hand, A(X' , :) = S'A(X, :) for some matrix S', because 
X' C X. Hence, A(X, :)9 = and A(X', :)9 = define the 
same constraints. This shows (b). 

The next step of the proof is to show that every basic 
solution of d2~n > has its entries being either —1, or 1. Denote 
the matrix B\ as the first 2n columns of C, and let B\ 
be any square submatrix of B\. If B\ has two columns (or 
rows) which are the same or negative of each other, then 
det(-Bi) = 0. Otherwise, B\ is a (possibly row and/or column 
permuted) square submatrix of A, and A is assumed to be 
totally unimodular. Hence, det(Si) G { — 1,0,1} and B\ is 
totally unimodular. Next consider the matrix B defined as 









\A(I,:) 




-A(X,:) 


-%l 


%l 


0" 


4 [c d] 




A(I',:) 




-A(X',:) 

















_A(k,:) 




-A(k,:) 








1 










0" 












Bi 





























1 











Denote the number of rows and the number of columns of B as 
tub and riB respectively. Let J C {1, 2, . . . , tib} be any set 
of column indices of B such that | J\ = tub (so that B(:,J) is 
square). If £>(:, J) contains only columns of B\, then det(£>(: 
,<J)) G { — 1,0, 1} since B\ is totally unimodular. Otherwise, 
by repeatedly applying Laplace expansion on the columns of 
B(:,J) which are not columns of B\, it can be shown that 
det(-B(:, J)) is equal to the determinant of a square submatrix 
of B\, which can only be —1, or 1. Hence, by Cramer's rule 
the following holds: If v is the solution to the following system 
of linear equations 



B{:,J)v = B(:,n B ), J C {1, . • . , n B - 1}, 
\J\=m B , and det(B(:,J))^0, 



then 



;(j)G {-1,0,1}, Vj. 



(23) 



(24) 



Theorem 15. II and (l23l together imply that the nonzero entries 
of all basic solutions to (l2lT > are either —1, or 1. Therefore, 
the basic feasible solutions, which are also basic solutions, to 
the polyhedron in (fJTJ also satisfy this integrality property. 
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Finally, let , yi,y*) be an optimal basic feasible 

solution. Then feasibility (i.e., nonnegativity) implies that 

i* (7) e {0,1}, z*(j)G{0,i}, 
240') g {0,1}, y*(j)e{0,i}, Vj. 

The minimization excludes the possibility that, at optimality, 
yi(j') = y* (j) = 1. Hence, it is possible to define x* and 
such that 

a;*(i) = -x*_(i)) G {-1,0,1} Vi 

y*0') = (v+Ci) + 2/-0')) = *1 e {0, 1} v j. 

(26) 
■ 

The second lemma is concerned with a restricted version of 
( [Tol l with an infinity norm bound as follows: 



minimize 



subject to 



\\A(i,:)x\\ Q 

A(k,:) x = 1 
A(l, :)x = 
IIAxll < 1. 



(27) 



Lemma 5.2: Optimization problems (TTOb and ( f27l > are 
equivalent. 

Proof: Suppose ( fTOb is feasible, then it has an optimal 
solution denoted as x* . Let Z x * C X be the row index set such 
that A(j, :)x* 7^ if and only if j G 2^*. Then it is claimed 
that there exists a common optimal solution to both (fTOb and 
(l2Tb with the same optimal objective value. The argument 
is as follows. The property of x* implies the feasibility of 
(fl2]) . which is denoted as a variant of (TTZt with I replaced 
by I x *. By l22l Corollary 2.2, p. 65], problem (|12p . as a 
standard form LP problem, has at least one basic feasible 
solution. Furthermore, since the optimal objective value of 
(|12p is bounded from below (e.g., by zero), Il22l Theorem 2.8, 
p. 66] implies that (fT2"|) has an optimal basic feasible solu- 
tion (x*L , x*_ , yl, y*_ ) which is integer- valued as specified by 
Lemma IBTTl Denote x* = (x*, — x*_ ). Then x* is feasible to 
both ([TOll and ([27) since %.* C X, :)x*| G {0, l}' 1 ** 

and fc G X x *. Also, ||A(X, :)x*|| Q = || A^* , :)x* || Q < 
||A(2a;*, :)x*|| = || A(I, :)x* || , as the inequality is true 
because x* is an optimal solution to (TT2"1) '. Hence x* is optimal 
to both ([Tol l and dZTl i. with the same objective value. 

Conversely, suppose (TTOb is infeasible, then (l27l i is also 
infeasible. This concludes that (TTOb and (ffTJi are equivalent. 



Proof of Theorem \3.1\ Let [x* + , 2:* , y * , y* ) be an 
optimal basic feasible solution to dT2T> . Then there exist x* and 
as defined in Lemma IBTTI In particular, x* = (x*j_ — x*_ ). 
It can be verified that (x*,y*) is an optimal solution to the 
following optimization problem: 



minimize 
subject to 



|2| 

EyW 

•7=1 

A(I,:)a:<y 
-A(l,:)x<y 
A(k,:)x = 1 
A(X, :)x = 
< y(j) < l 



where the inequalities above hold entry-wise. Because of the 
property that y*(j) G {0,1} for all j, (x*,y*) is also an 
optimal solution to 



minimize 
subject to 



m 

Ew(<) 

i=i 

A(X, :)x < y 
-A(J, :)x < y (28) 
A(fc,:)a; = 1 
A(X, :)x = 

y(i)G{0,l} Vi = 1,2, . . . , |X|. 

It can be verified that (l28l is equivalent to (|27| >. Then 
Lemma IBT21 states that d28l is also equivalent to (fTOb . Conse- 
quently, (x*,y*) being an optimal solution to (1281 implies that 

m 

(fTOb is feasible with optimal objective value being V*{j)- A 

i =1 

feasible solution to (TTOb is x*. Since y*(j) = |A(T(j), :)x*| G 

|Xj 

{0, 1} Vj, it holds that ||A(Z, :)x*|| Q = £ y*(j). Hence, 
is an optimal solution to (TTOb . 



VI. Numerical Demonstration 

As a demonstration, instances of the restricted security index 
problem in (O are solved with P being an identity matrix 
and I being empty. The incidence matrix B describes the 
topology of one of the following benchmark systems: IEEE 
14-bus, IEEE 57-bus, IEEE 1 18-bus, IEEE 300-bus and Polish 
2383-bus and Polish 2736-bus ll35ll . For each benchmark, (0 is 
solved for all possible values of k (e.g., 186 choices in the 1 18- 
bus case and 411 choices in the 300-bus case). Two solution 
approaches are tested. The first approach is the one proposed. 
It is denoted the li approach, and includes the following steps: 

1) Set up the LP problem in ([12]) with A being B T . 

2) Solve (22) using a LP solver (e.g., CPLEX LP). Let 
(x+ , x*_ , y+ , y*_ ) be its optimal solution. 

3) Define A9* = x* + — x*_ . It is the optimal solution to Q, 
according to Theorem 13. II 

The second solution approach to (O is standard, and it was 
applied also in (8), ifTTI . This second approach is referred to as 
lo approach, as (O is formulated into the following problem: 



minimize 

AS, y 

subject to 



B T A6 < My 

-B T A6 < My 

B{:,k) T A6 = 1 

y(j) G {0,1} Vj, 



(29) 



Vj = l,2,. 



HI 



where M is a constant required to be at least 1 1 B T 1 1 = 1 1 B \ \ 1 
(i.e., the maximum column sum of the absolute values of the 
entries of B) J8J. Because of the binary decision variables in 
y, d29l ) is a mixed integer linear programming (MILP) problem 
ll22ll . It can be solved by a standard solver such as CPLEX. 
The correctness of the Iq approach is a direct consequence 
that ( f29b is a reformulation of (O. As a result, both the l\ 
and Iq approaches are guaranteed to correctly solve (O by 
theory. Fig. [2] shows the sorted security indices (i.e., optimal 
objective values of (O) for the four larger benchmark systems. 
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The security indices are computed using the h approach. As a 
comparison, the security indices are also computed using the 
lo approach, and they are shown in Fig. [3] The two figures 
reaffirm the theory that the proposed li approach computes 
the security indices exactly. Fig. [2] (or Fig. [3) indicates that 
the measurement systems are relatively insecure, as there exist 
many measurements with very low security indices (i.e., equal 
to 1 or 2). 



10 J 
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0) 5 
g 

4 

'i_ 

=l 
o 

<X> 3 

03 
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Fig. 2: Security indices using the l\ approach 
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Fig. 3: Security indices using the Iq approach 

In terms of computation time performances, it is well-known 
that the Iq approach is much more time-consuming than the 
l\ approach since a MILP problem is much more difficult 
to solve than a LP problem of the same size ll22l . Fig. [4] 
shows the solve-time for computing all security indices for 
each benchmark system, using the l\ and Iq approaches. It 
verifies that the proposed 1% approach is more effective. In the 
above illustration, all computations are performed on a dual- 
core Windows machine with 2.4GHz CPU and 2GB of RAM. 



S 10° 



10 



10 



O 10 approach 
■ X - 11 approach 




3 4 

case number 



Fig. 4: Solve-time for computing all security indices for 
different benchmark systems 



VII. Conclusion 

The cardinality minimization problem is important but in 
general difficult to solve. An example is shown in this paper 
as the smart grid security index problem in ©. The li 
relaxation has demonstrated promise but to establish the cases 
where it provides exact solutions is non-trivial. Well-known 
results based on mutual coherence and RIP provide sufficient 
conditions under which a unique optimal solution solves both 
the cardinality minimization problem and its li relaxation. 
However, this paper identifies a class of application motivated 
problems (as in ( [Tol l) which can be shown to be solvable by l\ 
relaxation, even though results based on mutual coherence and 
RIP cannot make the assertion. In fact, the optimal solution to 
([Tol l might not be unique. The key property that leads to the 
conclusion of this paper is total unimodularity of the constraint 
matrix. The total unimodularity of matrix A in (fT0T > leads to 
two important consequences. (TToT > is equivalent to its oo-norm 
restricted version in ( f27l i. Furthermore, (|27| | can be solved 
exactly by solving the LP problem in (fT2l . thus establishing 
the conclusion that li relaxation exactly solves ( TTOb . 

Appendix 

A. Proof of the equivalence between (|6j and (0 

Note that the constraint H(l, :)A9 = implies that 
||.ffAf5|| = \\H(T,:)Ae\\ . Since P consists of rows of an 
identity matrix and D is diagonal and nonsingular, for all 
J C {1, 2, . . . , m}, there exists a diagonal and nonsingular 
matrix Dj such that P(J,:)D = DjP{J,:). In particular, 
let Dkk be a positive scalar such that P(k, :)D = DkkP(k, : 
) = P(k, :)Dkk- The above implies that for all A9, 

P(k, :)B T A8 = 1 if and only if 

P(k, -)D kk B T {D dd - l A6) = P(k, :)DB T (D dd - 1 A9) = 1. 
In addition, for all A9 

P(X, :)B T A6 = if and only if 

Duk^DxPiX, :)B T AO = P(X, :)DB T (D^ 1 A0) = 0. 
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Finally, for all A6 

\\P(2,:)B T A9\\ Q = \\D kk - l D t P{l,-)B T A9\\ Q 
= \\P(l,:)DB T (D dd - 1 Ae)\\ . 

Applying the definition of H in and a change of decision 
variable to Dkk~ 1 A9 shows that (0 and (0 are equivalent. 

B. Proof of Proposition 12.71 
Part (a) is trivial. 

For the necessary part of (b), condition I is necessary 
because if H{k, :) = then rimk(H(J, :)) = r&nk(H(J U 
{k}, :)) for all J (meaning that is infeasible). Condition 
II is also necessary because if rank(_ff ) < n. then there does 
not exist any J such that iank(H(J ', :)) = n. 

For the sufficiency part of (b), assume that conditions I 
and II are satisfied. Then by part (a) problem 01 is feasible. 
Hence it has an optimal solution denoted as 9*. Define Jg* C 
{1, 2, . . . , m} such that p € Jg* if and only if H (p, :)9* ^ 0. 
By definition of Jg*, rank(iJ( Jg* , :)) < n. Also, k e Jg* 
because H(k, :)6* = 1. If rank(ff(^7 U {k}, :)) = n, then 
Jg* is feasible to 01, thus showing that (0 is feasible. To 
show this, first consider the case when ||7J0*|| O = 1. Then 
Jg* = {k} and rank(# \Jg~* U {k}, :)) = rank(iJ) = n 
because of condition II (i.e., H has full column rank). Next 
consider the case when ||iI0*|| o > 1 (i.e., \ Jg* \ {fc}| > 0). If 
r&nk(H(Jg* U {&},:)) < n, then there exists 9^0 such 
that U{jg~* U {k},:)9 = 0. In particular, H(k,:)9 = 0. 
Also, condition II implies that H(Jg* \ {k},:)9 ^ (since 
otherwise H9 = 0). Let q G Je* \ {k} such that :)0 ^ 0. 
Note also that by definition of Jg*, H(q, :)9* ^ 0. Construct 
9' = (H(q, :)9)9* - (H(q, :)9*)9. Then H(k, :)9' = 1, 
H{p, :)9' = whenever H(p, :)9* = 0, but H(q, :)9' = 
while H(q, :)9* ^ 0. This implies that 9' is feasible to with 
a strictly less objective value than that of 9* , contradicting the 
optimality of 9* . Therefore, the claim that iank(H(Jg* U{fc}, : 
)) = n is true. This implies that Jg* is feasible to 0, 
establishing the sufficiency of part (b). 

For part (c), under conditions I and II both (0 and (0 
are feasible. In addition, Jg* constructed in the proof of 
the sufficiency part of (b) satisfies 1,70*1 = ||-ff$*Ho> f° r ®* 
being an optimal solution to 0. This means that the optimal 
objective function value of (0 is less than or equal to that of 
0. For the converse, suppose that J* is optimal to 0, then 
the feasibility of J* implies that there exists 9j* ^ such that 
H{J*,:)9j* = 0. This also implies that ||-H"^*|| < \ J*\. 
If H(k,:)9j* = 0, then H(J* U {k},:)9j* = 0. This 
implies that Taxik(H(J* U {k},:)) < n, contradicting the 
feasibility of J* . Therefore, there exists a scalar a such that 
H(k, :)(a9j*) = 1. Consequently, a9j* is feasible to (0 with 
an objective function value less than or equal to the optimal 
objective function value of 0. 
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